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SPECIFICATION 



SKIP LIST DATA STORAGE DURING COMPILATION 



FIELD OF THE INVENTION 



The invention relates to optimizing compilers 

and methods of compiling. More particularly, the 

invention relates to optimizing routines used in 
compiling which require a data flow analysis. 

BACKGROUND OF THE INVENTION 

Compilers are generally used to transform one 
representation of a computer program procedure into 
another representation. Typically, but not exclusively, 
compilers are used to transform a human readable form of 
a program such as source code into a machine readable 
form such as object code. 

One type of compiler is an optimizing compiler 
which includes an optimizer or optimizing module for 
enhancing the performance of the machine readable 
representation of a program. Some optimizing compilers 
are separate from a primary compiler, while others are 
built into a primary compiler to form a multi-pass 
compiler. Both types of compilers may operate either on 
a human readable form, a machine readable form, or any 
intermediate representation between these forms. 

Many optimizing modules of compilers operate 
on intermediate representations of computer programs or 
procedures. Typically a program or procedure being 
translated is broken down into a series of "statements", 
each of which contains zero or more "operands" or "data 
items". A data item may be "defined", meaning that it 
is given a value by the statement, or "used", meaning 
that its value is fed into the computation represented 
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by the statement. For example, the statement "x = y + 
z" defines x and uses y and z. Optimization of a 
program often involves locating individual statements or 
groups of statements which can be eliminated or 
rewritten in such a way as to reduce the total number of 
statements in the program or in a particular flow path 
through the program. For example, a complicated 
expression might be computed at two distant points 
within the same procedure. If the variables used in the 
expression are not modified to contain different values 
between the first and second computations, the value can 
be computed only once, at the first point in the 
procedure, and saved in a temporary location for use at 
the second point in the procedure, thus avoiding 
recomputation at the second point. This particular form 
of optimization is known as "common (sub) expression 
elimination" . 

The main problem in optimizing a procedure is 
to determine at which points of the procedure various 
kinds of information are available. For example, to 
perform common ( sub) expression elimination, it is 
necessary to know at which points the variables used by 
the procedure are modified. To determine such facts, a 
dataflow analysis is performed on the program. 

To perform dataflow analysis, possible paths 
of execution through a procedure may be represented by a 
control flow graph (CFG) . Statements may be grouped 
together into basic blocks, which are maximal sequences 
of straight-line code. In other words, there is no way 
to branch into or out of a basic block except at the 
beginning or end. A CFG is a graph with one node for 
each basic block in the procedure. The CFG includes an 
arc from block A to block B if it is possible for block 
B to be executed immediately after block A has been 
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executed. In such a case, B is called a '^successor" of 
A, and A is called a ^'predecessor" of B. 

The CFG is generated by a forward pass through 
the procedure to identify basic blocks and transitions 
between basic blocks, and form an ordered representation 
of those blocks and the branches between blocks. One 
well-known approach for ordering the blocks in the CFG 
is to form a ''depth first" ordering of the basic blocks 
of the program. This approach is described in Alfred V. 
Aho, Ravi Sethi, and Jeffrey D, Ullman, Compilers : 
Principles. Techniques, and Tools . Addison-Wesley, 
copyright 1986, reprinted 1988, which is incorporated by 
reference herein, particularly in sections 10,6 and 
10.9. In a depth first ordering, each basic block is 
assigned a "dfo" number, with the following property: if 
every path from the start of the program to block Y must 
pass through block X, then the dfo number for X is less 
than the dfo number for Y, which is written 
dfo (X) < dfo (Y) . 

After generating a CFG, optimization typically 
involves computing various properties at points of 
interest in the procedure, for example, the properties 
of the statements in each block in the CFG. Often, a 
matrix of binary values (bits) such as is shown in Fig. 
1, is used to identify these properties. In a typical 
approach, there are several rows 10 in the matrix for 
each block in the program, each row 10 representing one 
property of the statements in the block. There is one 
column 12 in the matrix for each property of interest 
during optimization. At each row and column location, 
there is a bit which has either a "1" or a "0" value. 

For example, in the matrix shown in Fig. 1, 
each block B is associated with four rows 10, a row 
in[B] for identifying expressions that are available 
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upon entry to block B, a row out[B] for identifying 
expressions that are available upon exit from block B, a 
row gen[B] for identifying expressions that are 
generated by statements in block B, and a row killtB] 
for identifying expressions whose constituent variables 
are modified by statements in block B. The columns 12 
in the matrix relate to particular expressions, numbered 
1, 2, 3 etc. Thus, the located in the row for inEBg] 

and the column for expression 6, indicates that 
expression 6 is available upon entry to block Bg; the '^0" 
located in the row for in[Bi] and the column for 
expression 2, indicates that expression 2 is not 
available upon entry to block Bi. 

A difficulty that arises with the 
representation shown in Fig. 1, is that in practice, 
most of the bits in the matrix are zero (i.e., the 
matrix is "sparse") . In a typical case where the bits 
in the matrix relate to the status of particular 
expressions, the matrix is typically sparse because, 
normally, specific expressions are only used or useful 
in a small portion of a procedure. A large, sparse bit 
matrix not only consumes large quantities of space, but 
also requires a large amount of time to repeatedly scan 
in the manner needed for complex dataflow analysis. 
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SUMMARY OF THE INVENTION 



The invention addresses these and other 
problems associated with the prior art by utilizing a 
skip-list data structure for representing properties of 
points of interest in a procedure. When sets thus 
represented are sparse, this data structure 
substantially reduces the storage space required for 
storing such properties, and can be scanned much more 
rapidly than the corresponding sparse bit matrix.. 

Specifically, in one aspect, the invention 
features a method of storing properties associated with 
a computer procedure, in a linked list of data storage 
nodes. Each of the nodes in the linked list stores a 
property of the computer procedure, and the nodes are 
ordered in accordance with a predetermined property 
order. When a new property is associated with the 
computer procedure, a data storage node for the property 
is generated and added to the linked list. Each data 
storage node includes a data storage space for storing 
an identifier of a property, and at least a first 
pointer storage space for storing a pointer identifying 
a location of an other data storage node. The data 
storage space of each data storage node stores an 
identifier of the new property for which the node was 
allocated. The first pointer storage space of each data 
storage node stores a pointer identifying a location of 
an other data storage node, specifically, the pointer 
identifies a subsequent node in the linked list. 

In disclosed specific embodiments, the linked 
list includes data storage nodes of either a first 
smaller size or a second larger size; the sizes of the 
data storage nodes are randomly selected. The data 
storage nodes of the second larger size include a second 
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pointer storage space for storing a pointer identifying 
a location of an other data storage node. The pointer 
stored into the second pointer storage space identifies 
the location of the next subsequent data storage node of 
the second larger size . 

In the disclosed specific embodiment, when a 
property is disassociated with the computer procedure, 
the associated data storage node becomes dispensable and 
is deleted. 

In accordance with a second aspect of the 
invention, a special procedure is used to initialize 
skip lists, prior to performing data flow analysis, to 
ensure that the skip list structure is not used in an 
inefficient manner as a result of initialization, 
particularly where an iterative dataflow analysis 
technique is used in which iterations of the analysis do 
not increase the membership of the property sets. 
Specifically, each basic block of a computer procedure 
is associated with sets of entry properties and exit 
properties, as well as property modifications caused by 
the basic block. The basic blocks are then selected and 
initialized in a predetermined order. The entry 
properties of the currently selected basic block, are 
copied from exit properties of a previously selected and 
processed basic block. Next, the exit properties for 
the currently selected basic block, are computed from 
the entry properties and the property modifications 
associated with the currently selected basic block. 

In the disclosed specific embodiment, the 
entry and exit properties are sets of expressions 
available upon entry and exit from the basic block, and 
the property modifications are expressions generated and 
killed by the basic block. Also, to further improve the 
initialization of sets for a basic block, any 
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expressions not found in the sets of expressions 
available upon exit from all previously selected and 
processed control flow predecessors of the currently 
selected basic block, are removed from the set of ■ 
expressions available upon entry to the- currently 
selected basic block. 

In a third aspect, the invention features the 
structure of the linked list per se. Specifically, this 
structure includes a plurality of data storage nodes, 
each data storage node including a data storage space 
for storing an identifier of a property and at least a 
first pointer storage space for storing a pointer 
identifying a location of an other data storage node. 
The data storage space of each node stores an identifier 
of a first property for which the node was allocated, 
and the first pointer storage space of each data storage 
node stores a pointer identifying a location of an other 
data storage node, the other data storage node storing a 
second property associated with the computer procedure 
that is subsequent to the first property in a 
predetermined property order. 

In a further aspect, the invention features a 
computer system for compiling a computer procedure into 
a machine-readable representation, comprising an 
optimizer that optimizes the computer procedure into an 
optimized representation by storing and manipulating 
properties associated with the computer procedure in 
accordance with the aspects described above. 

In still a further aspect, the invention 
features a program product configured to store 
properties associated with a computer procedure, in a 
linked list of data storage nodes in accordance with the 
aspects described above, and a signal bearing media 
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8 

bearing the program, which may be a transmission type 
media or a recordable media. 

These and other advantages and features, which 
characterize the invention, are set forth in the claims 
annexed hereto and forming a further part hereof. 
However, for a better understanding of the invention, 
and the advantages and objectives attained by its use, 
reference should be made to the Drawing, and to the 
accompanying descriptive matter, in which there is 
described embodiments of the invention. 



Page 8 of 39 
IBM R0997-127 

WH&E IBM/ 09 

Patent Application 



9 



BRIEF DESCRIPTION OF THE DRAWINGS 

FIGURE 1 is a data structure diagram 
illustrating the use in the prior art of a bit matrix to 
5 represent properties of points of interest in a computer 
procedure . 

FIGURE 2 is a block diagram of a computer 
system consistent with the invention. 

FIGURES 3A and 3B are data structure diagrams 
10 illustrating skip list data structures representing the 
information found in two rows of the bit matrix of Fig, 
2 . 

FIGURE 4 is a flow chart of specific 
operations performed as part of a data flow analysis 
15 using skip list data structures such as shown in Figs. 
3A and 3B. 

FIGURE 5 is a flow chart of specific 
operations performed as part of initializing skip list 
data structures such as shown in Fig. 3A and 3B, for the 
20 data flow analysis process of Fig. 4. 

FIGURE 6A is a flow chart of specific 
operations performed as part of simultaneously scanning 
two skip lists and removing from one list any nodes not 
in the other, as part of the data flow analysis and 
25 initialization processes of Figs. 4 and 5. 

FIGURE 6B is a flow chart of specific 
operations performed as part of simultaneously scanning 
three skip lists and removing from one list any nodes 
not in one of the others, as part of the data flow 
30 analysis process of Fig. 4. 

FIGURE 6C is a flow chart of specific 
operations performed as part of simultaneously scanning 
three skip lists and adding to one list any nodes 
meeting certain conditions in the other two lists. 
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DETAILED DESCRIPTION 

Prior to discussing the operation of 
embodiments of the invention, a brief overview 
discussion of compilers and compiling techniques is 
provided herein. 

Overview of Compilers 

Compilers and the like are generally known in 
the art. One known type of compiler is a multi-pass 
optimizing compiler, which includes a front-end module 
for converting source code into an intermediate 
representation, and a back-end module which takes the 
intermediate representation and generates object code. 

The front-end module of a multi-pass 
optimizing compiler typically includes a lexicographic 
analyzer which identifies tokens or keywords in the 
source code, and a parser which analyzes the program 
statement by statement. The parser typically uses a 
context-free grammar to determine if program statements 
satisfy a set of grammar rules, and builds constructs. 
The parser then generates an intermediate representation 
using an intermediate code generator. 

The back-end module of a multi-pass optimizing 
compiler typically includes an optimizer or optimizing 
module which operates on the intermediate representation 
to generate a revised or optimized intermediate 
representation. Several different optimizations may be 
performed, including but not limited to local 
optimizations such as value numbering, elimination of 
redundant computations, register allocation and 
assignment, instruction scheduling to match specific 
machine characteristics, moving invariant code out of 
loops, strength reduction, induction variable 
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elimination, and copy propagation, among others. The 
back-end module also includes a final code generator to 
generate the object code from the revised intermediate 
representation , 

5 A compiler may reside within the memory of the 

computer system upon which the object code generated by 
the compiler is executed. Alternatively, a compiler may 
be a cross-compiler which resides on one computer system 
to generate object code for execution on another 

10 computer system. Either type of compiler may be used 
consistent with the invention. 

One suitable back-end module for use with the 
invention is an AS/400 optimizing translator supplied 
with an AS/400 minicomputer, which is a common back-end 

15 module of an optimizing compiler. This product may be 
used with a front-end module such as the ILE C Compiler 
available from IBM, among others. It will be 
appreciated that other compilers are suitable for 
different languages and/or different hardware platforms, 

20 and may also be used in the alternative-. 



Computer System 

Turning to the Drawing, wherein like numbers 
denote like parts throughout the several views. Fig. 2 

25 shows a block diagram of a computer system 20 consistent 
with the invention. Computer system 20 is an IBM AS/400 
minicomputer. However, those skilled in the art will 
appreciate that the mechanisms and apparatus consistent 
with the invention apply equally to any computer system, 

30 regardless of whether the computer system is a 

complicated multi-user computing apparatus or a single 
user device such as a personal computer or workstation. 
As shown in Fig. 2, computer system 20 includes a- main 
or central processing unit (CPU) 22 connected through a 
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system bus 21 to a main memory 30, a memory controller 
24, an auxiliary storage interface 26, and a terminal 
interface 28. 

Auxiliary storage interface 26 allows computer 
system 20 to store and retrieve information from 
auxiliary storage such as magnetic disk, magnetic tape 
or optical storage devices. Memory controller 24, 
through use of a processor separate from CPU 22, moves 
information between main memory 30, auxiliary storage 
interface 26, and CPU 22. While for the purposes of 
explanation, memory controller 24 is shown as a separate 
entity, those skilled in the art understand that, in 
practice, portions of the function provided by memory 
controller 24 may actually reside in the circuitry 
associated with CPU 22 and main memory 30. Further, 
while memory controller 24 of the embodiment is 
described as having responsibility for moving requested 
information between main memory 30, auxiliary storage 
interface 26 and CPU 22, those skilled in the art will 
appreciate that the mechanisms of the present invention 
apply equally to any storage configuration, regardless 
of the number and type of the storage entities involved. 

Terminal interface 28 allows system 
administrators and computer programmers' to communicate 
with computer system 20, normally through programmable 
workstations. Although the system depicted in Fig. 2 
contains only a single main CPU and a single system bus, 
it will be understood that the invention also applies to 
computer systems having multiple CPUs and buses. 

Main memory 30 is shown storing a compiler 40 
(comprising analyzer 42, parser 44, optimizer 46 and 
code generator 48) and operating system 32. Memory 30 
also includes a workspace 50, which is shown storing a 
computer program in various stages of compilation, 
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including a source code representation 52, an 
intermediate representation 54, an optimized 
representation 56 and object code 58. However, it 
should be understood that main memory 30 will not 
necessarily always contain all parts of all mechanisms 
shown. For example, portions of compiler 40 and 
operating system 32 will typically be loaded into caches 
in CPU 22 to execute, while other files may well be 
stored on magnetic or optical disk storage devices. 
Moreover, the various representations 52-58 of a 
computer program may not be resident in the main memory 
at the same time. Various representations may also be 
created by modifying a prior representation in situ. In 
addition, as discussed above, the front-end and back-end 
modules in some systems may be separate programs. 

It will be appreciated that computer system 20 
is merely an example of one system upon which the 
routines may execute. Further, as innumerable 
alternative system designs may be used, principles of 
the present invention are not limited to any particular 
configuration shown herein. 

In general, the routines executed to implement 
the illustrated embodiments of the invention, whether 
implemented as part of an operating system or a specific 
application, program, object, module or sequence of 
instructions will be referred to herein as "computer 
programs". The computer programs typically comprise 
instructions which, when read and executed by one or 
more processors in the devices or systems in a computer 
system consistent with the invention, cause those 
devices or systems to perform the steps necessary to 
execute steps or generate elements embodying the various 
aspects of the present invention. Moreover, while the 
invention has and hereinafter will be described in the 
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context of fully functioning computer systems, those 
skilled in the art will appreciate that the various 
embodiments of the invention are capable of being 
distributed as a program product in a variety of forms, 
and that the invention applies equally regardless of the 
particular type of signal bearing media used to actually 
carry out the distribution. Examples of signal bearing 
media include but are not limited to recordable type 
media such as volatile and non-volatile memory devices, 
floppy disks, hard disk drives, CD-ROM's,, DVD's, 
magnetic tape, etc., and transmission type media such 
as digital and analog communications links. 
Use of Computer System 

Referring now to Figs. 3A and 3B, an 
explanation can be provided of the skip list data 
structure of the kind used in accordance with principles 
of the present invention. The skip list illustrated in 
Fig. 3A stores the set of properties in[Bi], 
specifically, the set of expressions available upon 
entry to basic block of the computer procedure 
currently undergoing compilation. The skip list 
illustrated in Fig. 3B stores the set of. properties 
InEBg], specifically, the expressions available upon 
entry to basic block B2 of the computer procedure 
currently undergoing compilation. 

As seen in Figs. 3A and 3B, each skip list 
begins with an initial data storage node 60a and 60b, 
and includes a number of data storage nodes 62a/ 62b, 
64a/64b, 66a/66b, 68a/68b, 70a/70b, 72a/72b etc. Each 
node includes a storage space 84 for storing a key for a 
property of the computer procedure. For example, the 
skip list shown in Fig. 3A, which identifies those 
expressions which are available upon entry to basic 
block Bi, includes nodes storing the keys ''3'', ""6", "7", 
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"9", "12'', "17", "19'', "21", "25" and "26", thus 
indicating that the expressions associated with these 
keys are available upon entry to basic block • 

The nodes in a skip list are one of a number 
5 of sizes. Nodes of the smallesf size (which will be 
referred to as "size 1") , such as nodes 62a and 62b, 
include only (a.) a storage space 84 for storing a key 
of a property of the computer procedure, and (b.) a 
storage space 86 for storing a "level 1" pointer 

10 (computer memory address) indicating the location of the 
next node in the skip list. Nodes of the next larger 
size (which will be referred to as "size 2") , such as 
nodes 68a and 64b, include storage spaces 84 and 8 6, and 
in addition, include a storage space 88 for storing a 

15 "level 2" pointer indicating the location of a 

subsequent node in the skip list. Specifically, the 
level 2 pointer in storage space 88 indicates the 
location of the next node in the skip list of at least 
size 2. 

20 The skip lists illustrated in Figs. 3A and 3B 

also include "size 3" nodes (e.g., nodes 78a and 70b) 
which include, in addition to the storage spaces 84, 86 
and 88 found in a size 2 node, an additional pointer 
storage space 90 for storing a "level 3" pointer to the 

25 next node in the skip list of at least size 3, 

Furthermore, the skip list of Fig. 3A includes a "size 
4" node. 64a, which includes, in addition to the storage 
spaces 84, 86, 88 and 90 found in a size 3 node, a 
"level 4" additional pointer storage space 92 for 

30 storing a pointer to the next node in the skip list of 
at least size 4 . 

The maximum number of levels for skip list 
nodes is arbitrary, and may, in one embodiment, be 
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selected based on the criteria identified by the above- 
referenced paper by Pugh. 

Skip lists terminate at a ''NIL" node 82. The 
level 1 pointer of the last node in the skip list points 
5 to the NIL node 82, Furthermore, the level 2 pointer of 
the last node in the skip list of at least size 2, 
points to the NIL node. The level 3 pointer of the last 
node in the skip list of at least size 3, points to the 
NIL node. The level 4 pointer of the last node in the 

10 skip list of at least size 4, points to the NIL node. 

The keys for properties stored in a skip list, 
are generated so that the keys can be sorted in a 
predetermined, defined order, in order to simplify the 
process of performing a membership test of a set 

15 represented by a skip list. In the example below where 
the skip lists store sets of expressions related to the 
computer procedure, a set of keys with a predetermined 
ordering can be created, for example, by numbering each 
.expression as it is encountered during a scan of the 

20 entire program in depth-first order. Then, these 

numbers can be used as the keys in the skip list data 
storage fields 84. To facilitate processing, the key 
for the NIL node is the highest number that can be 
represented in the data format used to store the keys, 

25 so that the key of the NIL node is always numerically 
greater than any other key in a skip list. 

Specific details on the use of skip lists, 
including the process for scanning a skip list to 
perform a membership test, methods for randomly 

30 selecting the size of skip list nodes, and further 

descriptions of the advantages of skip lists, can be 
found in William Pugh, Skip Lists: A Probabilistic 
Alternative to Balanced Trees , Communications of the 



Page 16 of 39 
IBM R0997-127 

WH&E IBM/ 09 
Patent Application 



ACM, June 1990, Vol, 33, Number 6, Pages 668 to 676, 
which is hereby incorporated herein in its entirety. 

Referring now to Fig. 4, the process for 
performing data flow analysis using the data structures 
discussed above can be more completely explained. 
Specifically, for initialization, a loop 100, 102, 104 
is performed. In this loop, for each block B in the CFG 
(step 100), the properties gen[B] and kill [B] are 
computed for the block (step 102), and the next block is 
selected (step 104), until all blocks have been 
processed. This step involves adding a node to the skip 
lists storing gen[B] and kill [B] for each expression to 
be included in those sets. Details on the process for 
adding a node to a skip list can be found from the 
above-referenced paper by Pugh. Details on methods for 
identifying expressions to be included in gen[B] and 
kill[B] as part of step 102 are described in the above- 
referenced section of the Aho et al . book. 

After computing the properties gen[B] and 
kill[B], processing continues to step 106, in which an 
initial approximation of the sets in[B] and out[B] are 
computed for each basic block B. Details on this 
initialization process are set forth below with 
reference to Fig. 5. 

After the appropriate initialization has been 
performed, a loop including steps 108, 110, 112, 114, 
116, 118, 120 and 122 is performed to make a dataflow 
analysis on the program and specifically the basic 
blocks thereof, to obtain final representations of in[B] 
and out[B], which can then be used in subsequent program 
optimizations such as common { sub } expression 
elimination. Each of steps 110, 112, 114, 116 and 118 
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is performed for each block B in the CFG (step 108) 
until all blocks have been processed (step 120) , 

For each block, taken in depth-first order, a 
second loop including 110, 112 and 114 is performed, for 
5 the purpose of eliminating expressions that are not- 
available upon exit of the predecessors of that block. 
Specifically, for each predecessor P of the currently 
selected block B (step 110) , the skip lists storing the 
sets in[B] and out[P] are 'Valked", or scanned, 

10 simultaneously, removing from in[B] any expressions not 
found in out[P] (step 112). Further details on the 
operations performed in connection with step 112 are 
provided below with reference to Fig. 6A, Step 112 is 
repeated for each predecessor P until all predecessors 

15 have been processed (step 114) . 

After completing this loop for each 
predecessor P of the currently selected block B, in step 
116, it is determined whether in[B] was changed as a 
result of steps 110, 112 and 116. (Note that the only 

20 change that might be made to in[B] is the removal of one 
or more members from the set of expressions represented 
by in[B].) If there have been changes to in[B], then, 
processing proceeds to step 118, in which outtB] for the 
same block is updated to reflect the changes to in[B]. 

25 Specifically, the skip lists which store a 

representation of in[B], out[B], and gen[B], are 
simultaneously ''walked", or scanned, to identify those 
nodes of outtB] which should be removed as a result of 
the changes to intB]. Further specific details on this 

30 process will be provided in connection with Fig. 6B . 

After out[B] has been updated- through step 
118, or immediately after step 116 if in[B] has not 
changed, the next block B in the CFG is selected, and 
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processing returns to step 110. This process repeats 
until every block B in the CFG has been processed, at 
which time, processing proceeds to step 122. In step 
122, it is determined whether any of the in[.] or out[.] 
sets, for any of the blocks in the CFG, were changed as 
a result of the previous pass through the blocks in the 
steps of loop 108, 110, 112, 114, 116, 118, 120. If so, 
then another pass through the blocks must be performed, 
and processing returns to step 108. However, if none of 
the in [ . ] and out [ . ] sets have changed in the previous 
pass, then correct final versions of the in[.] and 
out[.] sets have been achieved, and the data flow 
analysis is completed. Subsequently, the sets for in[.] 
and out[.] can be used in various optimizations, such as 
common sub-expression elimination. (Other methods may be 
used to select blocks to process and the order of 
processing, for example, a ""^worklisf approach could be 
used, in which a block is only processed if any of its 
predecessors changed since the last pass.) 

Referring now to Fig. 5, the process for 
calculating an initial approximation can be described in 
greater detail. As is noted above, the process for 
updating the in [ . ] and outt.] sets illustrated in Fig. 
4, updates these sets by removing those members which 
are determined to be unavailable upon entry and upon 
exit from specific blocks. Typically, a dataflow 
analysis process of this kind is initialized by 
including all possible expressions into in[B] and out[B] 
for every block B in the CFG, and then using a dataflow 
analysis to remove expressions in an iterative manner 
analogous to that shown in Fig. 4, until final resulting 
sets of expressions are obtained. 
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While this method of initializing the sets 
in[.] and out [ . ] is effective when these sets are 
represented as bit vectors as is shown in Fig. 1, where 
densely populated sets can be efficiently represented, 
5 this method is not effective when these sets are 

represented as skip lists such as is shown in Figs. 3A 
and 3B, because skip lists are intended for use in 
representing sparsely populated sets and are not 
efficient in representing densely populated sets. 

10 Therefore, in accordance with principles of 

the present invention, a novel methodology is used to 
initialize the sets in [ . ] and out[.]. This methodology 
is based on the recognition that (1) the dataflow 
analysis conducted in accordance with Fig. 4 only 

15 reduces the number of members in these sets, and (2) it 
is only necessary that in [ . ] and out [ . ] initially 
include all members that might be included after 
dataflow analysis. If a simplified analysis of the 
blocks B in the CFG can identify all of the members that 

20 might possibly be included in the in [ . ] and outt.] sets 
after a complete data flow analysis, then only those 
members that might possibly be included need be included 
when the in[.] and out [ . ] sets are initialized. 

To perform an appropriate initialization, 

25 therefore, in accordance with principles of the present 
invention, a single forward pass is made through the 
CFG, initializing the in [ . ] and out [ . ] sets by assuming 
that, in any given block B, all expressions available on 
exit from all control flow predecessors of block B that 

30 appear earlier in the depth-first order than block B, 
will be available in block B. 

Specifically, in a first step 130, sets are 
generated for an initial block Init, which by definition 
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precedes the first block in the CFG. The sets in[Init] 
and out[Init] for the block Init are initialized to be 
empty. Then, beginning at step 132, a loop, including 
steps 134^ through 142 is performed for each block in the 
CFG, taken in depth-first order. 

In the first step 134, the set in[B] for the 
current block B is copied from out[P] for a predecessor 
block P of the block B (where the predecessor P 
satisfies the requirement that df o (P) <df o (B) ) . This 
makes a first approximation for in[B], based on 
knowledge that no expression is available upon entry to 
block B unless it is available on exit from every 
predecessor P of B. Thus, the first approximation of 
in[B] formed by this step will include every expression 
that might be in the final set for in[B] after a 
complete dataflow analysis . 

After computing this initial approximation, 
beginning at step 136, a loop, including step 138, is 
performed for each remaining control flow predecessor P 
of block B which satisfies the requirement that 
df o (P) <df o (B) , Specifically, any expressions in the 
approximation for in[B] which are not available upon 
exit from any qualifying predecessor, are removed from 
in[B], Specifically, in step 138, the skip lists 
representing in[B] and out[B] are simultaneously 
""Valked", or scanned, to remove from in[B] any nodes not 
found in out[P] . Specific details of the operations 
involved in step 138 are detailed below* with reference 
to Fig. 6A. 

At step 140, step 138 is repeated until every 
predecessor P has been considered. The resulting 
iterative loop improves the approximation for in[B] 
formed in step 134 by limiting in[B] to those 
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expressions which are available upon exit from all 
predecessor blocks P appearing prior to B in the depth- 
first ordering. 

After all predecessors P for a block have been 
processed through step 138, processing continues to step 
142, at which an initial approximation of out[B] is 
generated. Specifically, out[B] is approximated from 
in[B] by adding to in[B] all expressions generated by 
block B (as represented by the previously-computed set 
gen[B]), and removing from in[B] any expressions killed 
by block B (as represented by the previously-computed 
set kill [B] ) . To make this approximation, the skip 
lists for in[B], kill [B] and gen[B] are ''walked", or 
scanned, simultaneously, adding to out[B] any nodes 
found in gen[B] and any nodes found in in[B] but not in 
kill [B] . Details on this operation are described below 
in connection with Fig. 6C. 

After step 142, at step 144, control is 
returned to step 134 to process the next block B in the 
CFG taken in depth-first order, until all blocks B have 
been so processed. Once all blocks B have been 
processed, the sets in[B] and out[B] are appropriately 
initialized so that these sets have an initial 
approximation of their final contents which. contains all 
of the members that might be included after a complete 
dataflow analysis according to Fig. 4, without including 
all expressions into these sets and thus incurring 
inefficient storage . 

Referring now to Fig. 6A, details of the 
operations performed in steps 112 (Fig. 4) and 138 (Fig. 
5) to walk the skip lists of in[B] and out[P] 
simultaneously, removing from in[B] any nodes not found 
in out[P]. As a first step in this process, in step 
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148, the first nodes in the skip lists for in[B] and 
out[P] are selected. Next, in step 150 it is determined 
whether the end of the skip list representing in[B] has 
been reached, which is determined by determining whether 
5 the key of the current node of the in[B] skip list is 
the highest integer, which is the key of the ''NIL" node 
82, If so, no further processing is needed and the 
operation is done. 

However, if in step 150 the end of the in[B] 

10 skip list has not been reached, then, in step 152, it is 
determined whether the key in the current node in the 
out[P] skip list, follows the key in the current node in 
the in[B] skip list in the predetermined key order. If 
so, this indicates that the expression represented by 

15 the current node in the in[B] skip list, is not present 
in the out[P] skip list. (Note that if the end of the 
outtP] skip list has been reached, then the key of the 
current node in the out[P] skip list is greater than any 
key in the in[B] skip list, and thus the current and all 

20 subsequent nodes in the in[B] skip list should be 

removed and are removed by repetitions of steps 154, 
156, 150 and 152, until the end of the in[B] skip list 
is reached.) When the current node in the intB] skip 
list is not present in the out[P] skip list, processing 

25 proceeds to step 154, where the current node in the 

in[B] skip list is deleted. (Details on the process for 
deleting a node from a skip list can be found in the 
above-referenced paper by Pugh.) As part of deleting 
the current node in in[B], a flag may be set, so that it 

30 can later be determined (in step 116, Fig. 4) that in[B] 
was changed. After deleting the current node in the 
in[B] skip list, in step 156, the next node in the in[B] 
skip list is selected (by following the level 1 pointer 
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of the current node to the first subsequent node) , and 
processing returns to step 150. 

If, in step 152, the key in the current node 
in the out[P] skip list, does not follow the key in the 
current node in the in[B] skip list in the predetermined 
key order, then processing continues to step 158, In 
step 158, it is determined whether the key in the 
current node in the in[B] skip list is equal to the key 
in the current node in the out[P] skip list. If not, 
then the key in the current node in the out[P] skip list 
must precede the key in the current node in the in[B] 
skip list, and accordingly, processing proceeds to step 
162, where the next expression in the out[P] skip list 
is selected (by following the level 1 pointer of the 
current node) , after which processing returns to step 
150 . 

If, however, in step 158, it is determined 
that the keys in the current nodes in the in[B] and 
out[P] skip lists are equal, then the expression 
represented by the current node in the in[B] skip list 
is in out[P], in which case, the next expression in the 
in[B] and out[P] skip lists should be evaluated. Thus, 
in this case, processing proceeds to step 160. In step 
160, the next node in the in[B] list is selected .(by 
following the level 1 pointer of the current node) , and 
then processing proceeds to step 162, where the next 
node in the out[P] skip list is selected. 

Referring now to Fig. 6B, the processing of 
step 118 (Fig. 4) can be described in more detail. In 
this step, the skip lists for in[B], gen[B] and out[B] 
are walked together, removing from out[B] any nodes not 
in either of in[B] or gen[B]. 
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To begin this process, in step 170 the first 
nodes in the in[B], gen[B] and out[B] nodes are 
selected. Next, in step 172, the current node in the 
out[B] list is evaluated, to determine whether the end 
of the out[B] skip list has been reached (by comparing 
the key for the current out[B] node to the largest 
possible integer, which is used in the NIL node) . If 
the end of the out[B] skip list has been reached, then 
processing is done. 

If the end of the out[B] skip list has not 
been reached, however, then processing proceeds to step 
174, in which the key of the current node in the gen[B] 
skip list is compared to the key of the current node of 
the out[B] skip list. If the key of the current node in 
the out[B] skip list follows the key of. the current node 
in the gen[B] skip list in the predetermined key 
ordering, then processing proceeds to step 176, and the 
next node in the gen[B] skip list is selected (by 
following the level 1 pointer of the current node in the 
gen[B] skip list), and then returns to step 174. This 
small loop including steps 174 and 176 continues to 
select subsequent elements in the gen[B] skip list until 
the key of the current element in gen[B] is either equal 
to or follows the key of the current node in the out[B] 
skip list. Once the key of the current node in the 
gen[B] skip list is either equal to or follows the key 
in the current node in the out[B] skip list, processing 
proceeds to step 178. 

In step 178, the key of the ciarrent node in 
the in[B] skip list is compared to the key of the 
current node of the out[B} skip list. If the key of the 
current node in the out[B] skip list follows the key of 
the current node in the intB] skip list in the 
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predetermined key ordering, then processing proceeds to 
step 180, and the next node in the in[B] skip list is 
selected (by following the level 1 pointer of the 
current node in the in[B] skip list), and then returns 
5 to step 178. This small loop including- steps 178 and 
180 continues to select subsequent elements in the in[B] 
skip list until the key of the current element in in[B] 
is either equal to or follows the key of the current 
node in the out[B] skip list. Once the key of the 

10 current node in the in[B] skip list is either equal to 
or follows the key in the current node in the out[B] 
skip list, processing proceeds directly to step 182. 

In step 182, the key of the current node in 
the out[B] skip list is compared to the keys of the 

15 current nodes in the gen[B] and in[B] skip lists. If 
the key of the current node in the out[B] skip list is 
not equal to either of the keys in the current nodes of 
the gen[B] or in[B] skip lists, then it can be 
determined that the expression represented by the key of 

20 the current node of the out[B] skip list is not in 
either gen[B] or in[B]; therefore, in this case, 
processing proceeds to step 184, and the current node in 
the out[B] skip list is deleted (details on the process 
for deleting a node from a skip list can be found in the 

25 above-referenced paper by Pugh) . After step 184, the 

next element in the out[B] skip list is selected in step 
186, and processing returns to step 172. However, if in 
step 182 the key of the current node in the out[B] skip 
list is equal to either or both of the keys in the 

30 current nodes of the gen[B] or intB] skip lists, then 
the expression represented by the key of the current 
node of the out[B] skip list is in one of gen[B] or 
in[B]; therefore, in this case, processing proceeds to 
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directly to step 186, and the next node in out[B] is 
selected, after which, processing returns to step 172. 

Referring now to Fig, 6C, the processing of 
step 142 (Fig. 5) can be described in more detail. In 
this step, the skip lists for in[B], gen[B] and kill [B] 
are walked together, adding to out[B] any nodes found in 
in[B] but not in kill [B] . 

To begin this process, in step 190 the first 
nodes of in[B], gen[B] and kill[B] are selected. Then, 
in step 192, the keys in the current nodes in in[B] and 
gentB] are evaluated to determine whether these nodes 
are the NIL node (having the highest integer number as a 
key) , If both of the current nodes are the NIL node, 
then processing is done. However, if this is not the 
case, then processing proceeds to step 194, in which the 
key of the current node in the gen[B] skip list is 
compared to the key in the current node in the in[B] 
skip list. If the key in the current node in the in[B] 
skip list follows the key in the current node in the 
gen[B] skip list, this indicates that there is at least 
one expression generated in the current block B, 
represented by a node in gen[B], that is not in in[B]; 
therefore, in this case, processing proceeds to step 
196, where the key in the current gen[B] node is added 
to the end of out[B], thus including the generated 
expression in the set of expressions available upon exit 
from block B; thereafter, in step 198, the next node in 
gen[B] is selected, by following the level 1 pointer of 
the current node in gen[B], and then processing returns 
to step 194. This loop including steps 194, 196 and 198 
is repeated until all of the elements which appear in 
gen[B] and are prior to the current element in in[B], 
have been added to the end of out[B] . (Note that 
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because the present routine is performed during 
initialization of out[B], the set of expressions 
represented by out[B] is initially empty, and so it is 
not necessary to insert nodes into any middle points in 
5 outEB]. This means that the process of adding nodes can 
be fairly rapid. The last node of each size is stored 
in memory as nodes are added. Then, when adding a new 
node to the end of the list of size n, the last node of 
each height less than or equal to n has its pointer (s) 

10 updated to point to the new node. The new node -then 
becomes the last node of size n.) 

Eventually, by proceeding to subsequent nodes 
in the gen[B] skip list, eventually a node is reached in 
gentB] which has a key that does not precede the key of 

15 the current in[B] node in the predetermined key order. 
In this situation, processing proceeds from step 194 to 
step 200, where it is determined whether the current 
node in the in[B] skip list is the NIL node. If so, 
then the end of the in[B] skip list has been reached, 

20 and processing returns directly to step 192. 

However, if the end of the in[B] skip list has 
not been reached, then processing proceeds to step 202, 
in which the key of the current node of the in[B] skip 
list is compared to the key of the current node of the 

25 kill [B] skip list. The purpose of this and the 

following steps 204 and 206 is to determine whether the 
key of the current node of the in[B] skip list should be 
added to the out[B] skip list, and to take appropriate 
actions. If the key of the current node of the kill [B] 

30 skip list precedes the key of the current node of the 
in[B] skip list in the predetermined key order, then 
processing proceeds to step 204, in which the next node 
in the kill[B] skip list is selected (by following the 
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level 1 pointer of the current node of the kill[B] skip 
list), and processing returns to step 202. 

This loop between steps 202 and 204 is 
continued until the key of the current node in the 
5 kill[B] skip list does not precede the key of the 

current node of the in[B] skip list. Once this occurs, 
processing proceeds from step 202 to step 206, in which 
it is determined whether the key of the current node in 
the kill[B] skip list is equal to the key of the current 
10 node in the in[B] skip list. If so, then the key found 
in in[B] is also found in kill [B] ; in such a situation, 
the key should only be added to outtB] if the key also 
appears in gen[B]. Stated another way, an expression 
which is available upon entry to a block, but is killed 
Cn 15 by that block, will not be available upon exit from the 

li block unless the expression is also generated by the 

rLI block. Accordingly, in this situation, processing 

Ji- proceeds to step 212, in which the key of the current 

'=^1 node in the in[B] skip list is compared to the key of 

■/| 20 the current node in the gen[B] skip list. If the keys 

'.3 are equal, then the expression represented by the 

current key in the current node of the in[B] skip list 
is generated by the current block B, and accordingly 
processing continues to step 214, in which a node with 
25 this key is added to the end of the out[B] skip list. 
Thereafter, the next node of the gen[B] skip list is 
selected in step 216, and the next node of the intB] 
skip list is selected in step 218, and processing 
returns to step 192. 
30 If, in step 212, it is determined that the key 

of the current node in the intB] skip list is not equal 
to the key of the current node in the gentB] skip list, 
then the expression represented by the key of the in[B] 
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node is not generated by the current block B. 
Accordingly, in this case, processing proceeds directly 
to step 218 to select the next node in in[B], and then 
returns to step 192. 
5 Furthermore, if in step 206, it is determined 

that the key of the current node in the' in [B] skip list 
is not equal to the key of the current node in the 
kill[B] skip list, then it is determined that the 
expression represented by the key in the current node of 

10 the in[B] skip list is available upon entry to block B 
and not killed by block B. In this case, processing 
proceeds to step 208, in which a node is added to the 
end of out[B] containing the key of the current node of 
in[B], thus effectively adding this expression to 

15 out[B] . Next, in step 210, the key of the current node 
of in[B] is compared to the key of the current node of 
gen[B]. If the keys are equal, then processing proceeds 
to step 216 to select the next node in gen[B] and then 
(in step 218) the next node in in[B] before returning to 

20 step 172. However, if the keys are unequal, then the 

key of the current node of gen[B] must follow the key of 
the current node in in[B], and accordingly processing 
proceeds directly to step 218 so that the next node is 
selected in only in[B] before proceeding to step 192. 

25 Following the foregoing procedures, a skip 

list representation can be used for the sets in a 
dataflow analysis performed by a compiler, and in 
subsequent optimizations, substantially improving the 
efficiency of data storage as compared to the use of bit 

30 vectors . 

It will therefore be appreciated that the 
invention provides significant advantages in terms of 
optimization of computer procedures during compilation. 
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resulting in more efficient compilation. It will also 
be appreciated that numerous modifications may be made 
to the disclosed embodiments consistent with the 
invention. For example, in some programs each block in 
5 the CFG may kill a large volume of the universe of 
expressions. If this is the case, then it may be 
advantageous to represent the sets kill [B] as bit 
vectors in the manner shown in Fig. 1, rather than as 
skip lists. If this approach is taken, then steps 202, 

10 204 and 206 of Fig. 6C would be replaced with a single 
step performing a comparison of the key of the current 
node of in[B] to the bit vector representing kill [B] to 
determine whether the key of the current node of in[B] 
[fi is a member of kill [B] . If so, processing would proceed 

15 to step 212, and if not, processing would proceed to 

V i 

%^ step 208. Other modifications may be made to the 

disclosed embodiments without departing from the spirit 

s and scope of the invention. Therefore, the invention 

lies in the claims hereinafter appended. 

ry 20 

tzi 
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